Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 134201 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 17.4 MiB |
| Average record size in memory | 136.0 B |
Variable types
| Text | 1 |
|---|---|
| Categorical | 3 |
| Numeric | 13 |
feature4 is highly overall correlated with feature6 and 1 other fields | High correlation |
feature5 is highly overall correlated with feature8 | High correlation |
feature6 is highly overall correlated with feature4 and 1 other fields | High correlation |
feature8 is highly overall correlated with feature5 | High correlation |
feature9 is highly overall correlated with feature4 and 1 other fields | High correlation |
feature15 is highly overall correlated with label | High correlation |
feature2 is highly overall correlated with feature13 | High correlation |
feature13 is highly overall correlated with feature2 | High correlation |
label is highly overall correlated with feature15 | High correlation |
label is highly imbalanced (66.0%) | Imbalance |
feature5 has unique values | Unique |
feature6 has unique values | Unique |
feature10 has unique values | Unique |
feature14 has unique values | Unique |
feature15 has unique values | Unique |
feature16 has unique values | Unique |
feature11 has 4866 (3.6%) zeros | Zeros |
feature12 has 13139 (9.8%) zeros | Zeros |
Reproduction
| Analysis started | 2023-06-18 17:39:44.776172 |
|---|---|
| Analysis finished | 2023-06-18 17:40:19.994398 |
| Duration | 35.22 seconds |
| Software version | ydata-profiling vv4.2.0 |
| Download configuration | config.json |
feature1
Text
| Distinct | 481 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
Length
| Max length | 59 |
|---|---|
| Median length | 37 |
| Mean length | 21.749599 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2918818 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Site engineer |
|---|---|
| 2nd row | Site engineer |
| 3rd row | Site engineer |
| 4th row | Site engineer |
| 5th row | Site engineer |
| Value | Count | Frequency (%) |
| officer | 13339 | 4.1% |
| engineer | 13231 | 4.1% |
| manager | 9818 | 3.1% |
| and | 6354 | 2.0% |
| scientist | 5433 | 1.7% |
| surveyor | 5413 | 1.7% |
| civil | 3865 | 1.2% |
| health | 3858 | 1.2% |
| therapist | 3854 | 1.2% |
| psychologist | 3796 | 1.2% |
| Other values (450) | 252639 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 300566 | 10.3% |
| i | 255305 | 8.7% |
| r | 236654 | 8.1% |
| a | 208071 | 7.1% |
| n | 203662 | 7.0% |
| t | 198736 | 6.8% |
| 187399 | 6.4% | |
| o | 170179 | 5.8% |
| s | 151529 | 5.2% |
| c | 140807 | 4.8% |
| Other values (43) | 865910 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2526944 | |
| Space Separator | 187399 | 6.4% |
| Uppercase Letter | 141788 | 4.9% |
| Other Punctuation | 57355 | 2.0% |
| Open Punctuation | 2666 | 0.1% |
| Close Punctuation | 2666 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 300566 | |
| i | 255305 | |
| r | 236654 | |
| a | 208071 | 8.2% |
| n | 203662 | 8.1% |
| t | 198736 | 7.9% |
| o | 170179 | 6.7% |
| s | 151529 | 6.0% |
| c | 140807 | 5.6% |
| l | 115857 | 4.6% |
| Other values (16) | 545578 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 15300 | |
| E | 14996 | |
| S | 14717 | |
| P | 13278 | |
| T | 12323 | 8.7% |
| A | 11292 | 8.0% |
| H | 7456 | 5.3% |
| M | 7092 | 5.0% |
| R | 6444 | 4.5% |
| I | 6009 | 4.2% |
| Other values (11) | 32881 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 40884 | |
| / | 15561 | 27.1% |
| ' | 910 | 1.6% |
Space Separator
| Value | Count | Frequency (%) |
| 187399 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2666 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2666 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2668732 | |
| Common | 250086 | 8.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 300566 | |
| i | 255305 | 9.6% |
| r | 236654 | 8.9% |
| a | 208071 | 7.8% |
| n | 203662 | 7.6% |
| t | 198736 | 7.4% |
| o | 170179 | 6.4% |
| s | 151529 | 5.7% |
| c | 140807 | 5.3% |
| l | 115857 | 4.3% |
| Other values (37) | 687366 |
Common
| Value | Count | Frequency (%) |
| 187399 | ||
| , | 40884 | 16.3% |
| / | 15561 | 6.2% |
| ( | 2666 | 1.1% |
| ) | 2666 | 1.1% |
| ' | 910 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2918818 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 300566 | 10.3% |
| i | 255305 | 8.7% |
| r | 236654 | 8.1% |
| a | 208071 | 7.1% |
| n | 203662 | 7.0% |
| t | 198736 | 6.8% |
| 187399 | 6.4% | |
| o | 170179 | 5.8% |
| s | 151529 | 5.2% |
| c | 140807 | 4.8% |
| Other values (43) | 865910 |
feature2
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
| grocery_pos | |
|---|---|
| shopping_pos | |
| home | |
| kids_pets | |
| gas_transport | |
| Other values (9) |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 10.47025 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1405118 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | grocery_pos |
|---|---|
| 2nd row | gas_transport |
| 3rd row | grocery_pos |
| 4th row | shopping_net |
| 5th row | health_fitness |
Common Values
| Value | Count | Frequency (%) |
| grocery_pos | 14217 | |
| shopping_pos | 13133 | |
| home | 12471 | |
| kids_pets | 11295 | |
| gas_transport | 10898 | |
| shopping_net | 10879 | |
| food_dining | 9633 | 7.2% |
| personal_care | 9526 | 7.1% |
| entertainment | 9171 | 6.8% |
| misc_pos | 8730 | 6.5% |
| Other values (4) | 24248 |
Length
| Value | Count | Frequency (%) |
| grocery_pos | 14217 | |
| shopping_pos | 13133 | |
| home | 12471 | |
| kids_pets | 11295 | |
| gas_transport | 10898 | |
| shopping_net | 10879 | |
| food_dining | 9633 | 7.2% |
| personal_care | 9526 | 7.1% |
| entertainment | 9171 | 6.8% |
| misc_pos | 8730 | 6.5% |
| Other values (4) | 24248 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 144885 | |
| e | 132966 | |
| o | 132026 | |
| n | 122066 | |
| p | 115823 | 8.2% |
| _ | 108283 | 7.7% |
| t | 103466 | 7.4% |
| r | 93841 | 6.7% |
| i | 86890 | 6.2% |
| g | 64316 | 4.6% |
| Other values (10) | 300556 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1296835 | |
| Connector Punctuation | 108283 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 144885 | |
| e | 132966 | |
| o | 132026 | |
| n | 122066 | |
| p | 115823 | |
| t | 103466 | |
| r | 93841 | 7.2% |
| i | 86890 | 6.7% |
| g | 64316 | 5.0% |
| a | 62030 | 4.8% |
| Other values (9) | 238526 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 108283 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1296835 | |
| Common | 108283 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 144885 | |
| e | 132966 | |
| o | 132026 | |
| n | 122066 | |
| p | 115823 | |
| t | 103466 | |
| r | 93841 | 7.2% |
| i | 86890 | 6.7% |
| g | 64316 | 5.0% |
| a | 62030 | 4.8% |
| Other values (9) | 238526 |
Common
| Value | Count | Frequency (%) |
| _ | 108283 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1405118 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 144885 | |
| e | 132966 | |
| o | 132026 | |
| n | 122066 | |
| p | 115823 | 8.2% |
| _ | 108283 | 7.7% |
| t | 103466 | 7.4% |
| r | 93841 | 6.7% |
| i | 86890 | 6.2% |
| g | 64316 | 4.6% |
| Other values (10) | 300556 |
feature3
Real number (ℝ)
| Distinct | 28125 |
|---|---|
| Distinct (%) | 21.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98.995174 |
| Minimum | 1 |
|---|---|
| Maximum | 15861.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.5 |
| Q1 | 9.9 |
| median | 48.59 |
| Q3 | 90.84 |
| 95-th percentile | 341.82 |
| Maximum | 15861.4 |
| Range | 15860.4 |
| Interquartile range (IQR) | 80.94 |
Descriptive statistics
| Standard deviation | 205.88166 |
|---|---|
| Coefficient of variation (CV) | 2.0797141 |
| Kurtosis | 536.85962 |
| Mean | 98.995174 |
| Median Absolute Deviation (MAD) | 39.15 |
| Skewness | 12.158061 |
| Sum | 13285251 |
| Variance | 42387.259 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.11 | 69 | 0.1% |
| 1.92 | 64 | < 0.1% |
| 1.15 | 64 | < 0.1% |
| 3.06 | 64 | < 0.1% |
| 1.41 | 61 | < 0.1% |
| 1.14 | 61 | < 0.1% |
| 3.54 | 61 | < 0.1% |
| 3.17 | 59 | < 0.1% |
| 1.54 | 59 | < 0.1% |
| 3.95 | 58 | < 0.1% |
| Other values (28115) | 133581 |
| Value | Count | Frequency (%) |
| 1 | 38 | |
| 1.01 | 44 | |
| 1.02 | 39 | |
| 1.03 | 39 | |
| 1.04 | 52 | |
| 1.05 | 48 | |
| 1.06 | 43 | |
| 1.07 | 52 | |
| 1.08 | 54 | |
| 1.09 | 54 |
| Value | Count | Frequency (%) |
| 15861.4 | 1 | |
| 13708 | 1 | |
| 9875.47 | 1 | |
| 9186.99 | 1 | |
| 8192.07 | 1 | |
| 7559.34 | 1 | |
| 6002.32 | 1 | |
| 5252.81 | 1 | |
| 4841.43 | 1 | |
| 4798.5 | 1 |
feature4
Real number (ℝ)
| Distinct | 893 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50721.259 |
| Minimum | 1106 |
|---|---|
| Maximum | 99791 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | 1106 |
|---|---|
| 5-th percentile | 7302 |
| Q1 | 28152 |
| median | 46222 |
| Q3 | 78045 |
| 95-th percentile | 94954 |
| Maximum | 99791 |
| Range | 98685 |
| Interquartile range (IQR) | 49893 |
Descriptive statistics
| Standard deviation | 29578.182 |
|---|---|
| Coefficient of variation (CV) | 0.58315158 |
| Kurtosis | -1.3174572 |
| Mean | 50721.259 |
| Median Absolute Deviation (MAD) | 26887 |
| Skewness | 0.07033489 |
| Sum | 6.8068437 × 109 |
| Variance | 8.7486887 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 85051 | 659 | 0.5% |
| 44053 | 593 | 0.4% |
| 94509 | 590 | 0.4% |
| 93458 | 473 | 0.4% |
| 22193 | 467 | 0.3% |
| 7650 | 367 | 0.3% |
| 32669 | 366 | 0.3% |
| 48095 | 363 | 0.3% |
| 28152 | 361 | 0.3% |
| 56224 | 361 | 0.3% |
| Other values (883) | 129601 |
| Value | Count | Frequency (%) |
| 1106 | 238 | |
| 1431 | 178 | |
| 1550 | 286 | |
| 1570 | 354 | |
| 1609 | 179 | |
| 1701 | 184 | |
| 1760 | 178 | |
| 1880 | 177 | |
| 1902 | 180 | |
| 2081 | 125 | 0.1% |
| Value | Count | Frequency (%) |
| 99791 | 233 | |
| 99737 | 234 | |
| 99709 | 7 | < 0.1% |
| 99654 | 178 | |
| 99508 | 12 | < 0.1% |
| 99218 | 181 | |
| 99206 | 181 | |
| 98354 | 299 | |
| 98312 | 118 | 0.1% |
| 98230 | 297 |
feature5
Real number (ℝ)
HIGH CORRELATION  UNIQUE 
| Distinct | 134201 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.236698 |
| Minimum | 11.873034 |
|---|---|
| Maximum | 76.845878 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | 11.873034 |
|---|---|
| 5-th percentile | 27.808553 |
| Q1 | 33.367728 |
| median | 37.492299 |
| Q3 | 41.133355 |
| 95-th percentile | 45.48065 |
| Maximum | 76.845878 |
| Range | 64.972844 |
| Interquartile range (IQR) | 7.7656272 |
Descriptive statistics
| Standard deviation | 5.7195164 |
|---|---|
| Coefficient of variation (CV) | 0.15359892 |
| Kurtosis | 2.4208133 |
| Mean | 37.236698 |
| Median Absolute Deviation (MAD) | 3.8585081 |
| Skewness | 0.38352382 |
| Sum | 4997202.1 |
| Variance | 32.712868 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40.21343879 | 1 | < 0.1% |
| 38.48606569 | 1 | < 0.1% |
| 37.12118263 | 1 | < 0.1% |
| 37.33541726 | 1 | < 0.1% |
| 36.81711117 | 1 | < 0.1% |
| 40.8536771 | 1 | < 0.1% |
| 37.53080221 | 1 | < 0.1% |
| 35.66482872 | 1 | < 0.1% |
| 37.25700491 | 1 | < 0.1% |
| 37.47805552 | 1 | < 0.1% |
| Other values (134191) | 134191 |
| Value | Count | Frequency (%) |
| 11.87303371 | 1 | |
| 14.89140096 | 1 | |
| 14.95912265 | 1 | |
| 15.07966291 | 1 | |
| 15.11185023 | 1 | |
| 15.67944168 | 1 | |
| 16.01816068 | 1 | |
| 16.02176482 | 1 | |
| 16.16453759 | 1 | |
| 16.40507518 | 1 |
| Value | Count | Frequency (%) |
| 76.8458781 | 1 | |
| 75.47804188 | 1 | |
| 74.71848258 | 1 | |
| 74.59075688 | 1 | |
| 74.3803469 | 1 | |
| 74.17610596 | 1 | |
| 74.08888983 | 1 | |
| 74.06084688 | 1 | |
| 73.95149566 | 1 | |
| 73.80880658 | 1 |
feature6
Real number (ℝ)
HIGH CORRELATION  UNIQUE 
| Distinct | 134201 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -91.838641 |
| Minimum | -173.21991 |
|---|---|
| Maximum | -63.066068 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 134201 |
| Negative (%) | 100.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | -173.21991 |
|---|---|
| 5-th percentile | -121.2964 |
| Q1 | -98.892598 |
| median | -87.142421 |
| Q3 | -79.599335 |
| 95-th percentile | -72.406641 |
| Maximum | -63.066068 |
| Range | 110.15384 |
| Interquartile range (IQR) | 19.293263 |
Descriptive statistics
| Standard deviation | 16.339139 |
|---|---|
| Coefficient of variation (CV) | -0.17791138 |
| Kurtosis | 0.55543038 |
| Mean | -91.838641 |
| Median Absolute Deviation (MAD) | 9.378661 |
| Skewness | -0.95982136 |
| Sum | -12324838 |
| Variance | 266.96747 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -85.2037563 | 1 | < 0.1% |
| -123.2611282 | 1 | < 0.1% |
| -121.7739182 | 1 | < 0.1% |
| -122.494415 | 1 | < 0.1% |
| -120.0465399 | 1 | < 0.1% |
| -117.9350593 | 1 | < 0.1% |
| -118.0754231 | 1 | < 0.1% |
| -120.8780758 | 1 | < 0.1% |
| -121.4645813 | 1 | < 0.1% |
| -119.7140041 | 1 | < 0.1% |
| Other values (134191) | 134191 |
| Value | Count | Frequency (%) |
| -173.2199057 | 1 | |
| -172.6492897 | 1 | |
| -172.0358687 | 1 | |
| -172.0274177 | 1 | |
| -171.9630915 | 1 | |
| -171.1974443 | 1 | |
| -171.0406383 | 1 | |
| -170.6418147 | 1 | |
| -170.5758417 | 1 | |
| -170.5605941 | 1 |
| Value | Count | Frequency (%) |
| -63.06606843 | 1 | |
| -63.42690452 | 1 | |
| -64.92263618 | 1 | |
| -65.04017887 | 1 | |
| -65.1725452 | 1 | |
| -65.34259827 | 1 | |
| -65.35198726 | 1 | |
| -65.35629868 | 1 | |
| -65.37930797 | 1 | |
| -65.41633269 | 1 |
feature7
Real number (ℝ)
| Distinct | 739 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 293853.95 |
| Minimum | 194 |
|---|---|
| Maximum | 2906700 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | 194 |
|---|---|
| 5-th percentile | 2661 |
| Q1 | 16719 |
| median | 62009 |
| Q3 | 247530 |
| 95-th percentile | 1577385 |
| Maximum | 2906700 |
| Range | 2906506 |
| Interquartile range (IQR) | 230811 |
Descriptive statistics
| Standard deviation | 552713.29 |
|---|---|
| Coefficient of variation (CV) | 1.8809115 |
| Kurtosis | 8.524005 |
| Mean | 293853.95 |
| Median Absolute Deviation (MAD) | 54143 |
| Skewness | 2.8610575 |
| Sum | 3.9435494 × 1010 |
| Variance | 3.0549198 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2906700 | 1618 | 1.2% |
| 1595797 | 1345 | 1.0% |
| 1577385 | 1141 | 0.9% |
| 1382480 | 1138 | 0.8% |
| 1312922 | 1092 | 0.8% |
| 2383912 | 1052 | 0.8% |
| 2504700 | 973 | 0.7% |
| 790689 | 917 | 0.7% |
| 910148 | 902 | 0.7% |
| 67952 | 882 | 0.7% |
| Other values (729) | 123141 |
| Value | Count | Frequency (%) |
| 194 | 123 | 0.1% |
| 237 | 233 | |
| 333 | 361 | |
| 392 | 71 | 0.1% |
| 441 | 186 | |
| 456 | 184 | |
| 614 | 68 | 0.1% |
| 631 | 74 | 0.1% |
| 710 | 231 | |
| 769 | 62 | < 0.1% |
| Value | Count | Frequency (%) |
| 2906700 | 1618 | |
| 2680484 | 314 | 0.2% |
| 2504700 | 973 | |
| 2383912 | 1052 | |
| 1737737 | 476 | 0.4% |
| 1595797 | 1345 | |
| 1577385 | 1141 | |
| 1526206 | 434 | 0.3% |
| 1417793 | 366 | 0.3% |
| 1382480 | 1138 |
feature8
Real number (ℝ)
| Distinct | 133688 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.234688 |
| Minimum | 18.798261 |
|---|---|
| Maximum | 71.485302 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | 18.798261 |
|---|---|
| 5-th percentile | 28.264083 |
| Q1 | 33.602904 |
| median | 37.544626 |
| Q3 | 40.976075 |
| 95-th percentile | 44.639542 |
| Maximum | 71.485302 |
| Range | 52.687041 |
| Interquartile range (IQR) | 7.373171 |
Descriptive statistics
| Standard deviation | 5.3845784 |
|---|---|
| Coefficient of variation (CV) | 0.14461188 |
| Kurtosis | 3.1425973 |
| Mean | 37.234688 |
| Median Absolute Deviation (MAD) | 3.662394 |
| Skewness | 0.47988769 |
| Sum | 4996932.4 |
| Variance | 28.993685 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 41.676577 | 2 | < 0.1% |
| 42.226678 | 2 | < 0.1% |
| 42.220395 | 2 | < 0.1% |
| 40.282775 | 2 | < 0.1% |
| 43.874085 | 2 | < 0.1% |
| 40.697807 | 2 | < 0.1% |
| 33.335547 | 2 | < 0.1% |
| 39.954891 | 2 | < 0.1% |
| 42.338957 | 2 | < 0.1% |
| 41.937471 | 2 | < 0.1% |
| Other values (133678) | 134181 |
| Value | Count | Frequency (%) |
| 18.798261 | 1 | |
| 18.823393 | 1 | |
| 18.829194 | 1 | |
| 18.83863 | 1 | |
| 18.850059 | 1 | |
| 18.862375 | 1 | |
| 18.883823 | 1 | |
| 18.894883 | 1 | |
| 18.909904 | 1 | |
| 18.912901 | 1 |
| Value | Count | Frequency (%) |
| 71.485302 | 1 | |
| 71.482581 | 1 | |
| 71.468627 | 1 | |
| 71.457397 | 1 | |
| 71.449817 | 1 | |
| 71.425062 | 1 | |
| 71.415706 | 1 | |
| 71.414967 | 1 | |
| 71.409252 | 1 | |
| 71.392575 | 1 |
feature9
Real number (ℝ)
| Distinct | 133967 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -91.844922 |
| Minimum | -169.01967 |
|---|---|
| Maximum | -69.13386 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 134201 |
| Negative (%) | 100.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | -169.01967 |
|---|---|
| 5-th percentile | -121.44173 |
| Q1 | -98.069477 |
| median | -86.945641 |
| Q3 | -80.010288 |
| 95-th percentile | -73.13566 |
| Maximum | -69.13386 |
| Range | 99.885809 |
| Interquartile range (IQR) | 18.059189 |
Descriptive statistics
| Standard deviation | 16.224433 |
|---|---|
| Coefficient of variation (CV) | -0.1766503 |
| Kurtosis | 0.58171777 |
| Mean | -91.844922 |
| Median Absolute Deviation (MAD) | 9.399738 |
| Skewness | -0.98164646 |
| Sum | -12325680 |
| Variance | 263.23222 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -82.650219 | 2 | < 0.1% |
| -86.99192 | 2 | < 0.1% |
| -81.807377 | 2 | < 0.1% |
| -74.344951 | 2 | < 0.1% |
| -85.165589 | 2 | < 0.1% |
| -81.049725 | 2 | < 0.1% |
| -99.115179 | 2 | < 0.1% |
| -82.307623 | 2 | < 0.1% |
| -122.988165 | 2 | < 0.1% |
| -73.905525 | 2 | < 0.1% |
| Other values (133957) | 134181 |
| Value | Count | Frequency (%) |
| -169.019669 | 1 | |
| -169.019569 | 1 | |
| -168.961024 | 1 | |
| -168.959559 | 1 | |
| -168.956874 | 1 | |
| -168.949138 | 1 | |
| -168.93595 | 1 | |
| -168.898636 | 1 | |
| -168.879543 | 1 | |
| -168.873607 | 1 |
| Value | Count | Frequency (%) |
| -69.13386 | 1 | |
| -69.134307 | 1 | |
| -69.137203 | 1 | |
| -69.137284 | 1 | |
| -69.141322 | 1 | |
| -69.15635 | 1 | |
| -69.157873 | 1 | |
| -69.160535 | 1 | |
| -69.170033 | 1 | |
| -69.173353 | 1 |
feature10
Real number (ℝ)
| Distinct | 134201 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48.272938 |
| Minimum | 23.447657 |
|---|---|
| Maximum | 97.121303 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | 23.447657 |
|---|---|
| 5-th percentile | 26.996285 |
| Q1 | 34.949863 |
| median | 44.951903 |
| Q3 | 58.63852 |
| 95-th percentile | 81.262134 |
| Maximum | 97.121303 |
| Range | 73.673646 |
| Interquartile range (IQR) | 23.688658 |
Descriptive statistics
| Standard deviation | 16.670031 |
|---|---|
| Coefficient of variation (CV) | 0.3453287 |
| Kurtosis | -0.098020166 |
| Mean | 48.272938 |
| Median Absolute Deviation (MAD) | 11.478462 |
| Skewness | 0.77448827 |
| Sum | 6478276.5 |
| Variance | 277.88993 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 65.59606218 | 1 | < 0.1% |
| 40.9696196 | 1 | < 0.1% |
| 39.88999032 | 1 | < 0.1% |
| 41.09778834 | 1 | < 0.1% |
| 41.25147157 | 1 | < 0.1% |
| 41.93139608 | 1 | < 0.1% |
| 40.4150176 | 1 | < 0.1% |
| 40.43900379 | 1 | < 0.1% |
| 40.38357223 | 1 | < 0.1% |
| 40.69024 | 1 | < 0.1% |
| Other values (134191) | 134191 |
| Value | Count | Frequency (%) |
| 23.44765672 | 1 | |
| 23.76904238 | 1 | |
| 23.86962919 | 1 | |
| 23.91537752 | 1 | |
| 23.94502762 | 1 | |
| 23.96328287 | 1 | |
| 23.99754265 | 1 | |
| 24.02708902 | 1 | |
| 24.03170502 | 1 | |
| 24.07089988 | 1 |
| Value | Count | Frequency (%) |
| 97.12130289 | 1 | |
| 96.99612844 | 1 | |
| 96.89987523 | 1 | |
| 96.79043027 | 1 | |
| 96.78070484 | 1 | |
| 96.71757679 | 1 | |
| 96.70224064 | 1 | |
| 96.68846318 | 1 | |
| 96.68206388 | 1 | |
| 96.60726724 | 1 |
feature11
Real number (ℝ)
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.727856 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 4866 |
| Zeros (%) | 3.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 7 |
| median | 14 |
| Q3 | 19 |
| 95-th percentile | 23 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 7.0487407 |
|---|---|
| Coefficient of variation (CV) | 0.55380424 |
| Kurtosis | -1.1605634 |
| Mean | 12.727856 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -0.24495943 |
| Sum | 1708091 |
| Variance | 49.684745 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 22 | 8377 | 6.2% |
| 23 | 8334 | 6.2% |
| 20 | 6433 | 4.8% |
| 13 | 6418 | 4.8% |
| 19 | 6388 | 4.8% |
| 17 | 6349 | 4.7% |
| 12 | 6319 | 4.7% |
| 21 | 6297 | 4.7% |
| 18 | 6294 | 4.7% |
| 16 | 6292 | 4.7% |
| Other values (14) | 66700 |
| Value | Count | Frequency (%) |
| 0 | 4866 | |
| 1 | 4908 | |
| 2 | 5073 | |
| 3 | 5024 | |
| 4 | 4182 | |
| 5 | 4375 | |
| 6 | 4337 | |
| 7 | 4356 | |
| 8 | 4274 | |
| 9 | 4291 |
| Value | Count | Frequency (%) |
| 23 | 8334 | |
| 22 | 8377 | |
| 21 | 6297 | |
| 20 | 6433 | |
| 19 | 6388 | |
| 18 | 6294 | |
| 17 | 6349 | |
| 16 | 6292 | |
| 15 | 6254 | |
| 14 | 6247 |
feature12
Real number (ℝ)
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.4779025 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 13139 |
| Zeros (%) | 9.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.9938854 |
|---|---|
| Coefficient of variation (CV) | 0.57330112 |
| Kurtosis | -1.1908398 |
| Mean | 3.4779025 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.296005 |
| Sum | 466738 |
| Variance | 3.9755791 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 27584 | |
| 5 | 24375 | |
| 4 | 20711 | |
| 2 | 17867 | |
| 1 | 15397 | |
| 3 | 15128 | |
| 0 | 13139 |
| Value | Count | Frequency (%) |
| 0 | 13139 | |
| 1 | 15397 | |
| 2 | 17867 | |
| 3 | 15128 | |
| 4 | 20711 | |
| 5 | 24375 | |
| 6 | 27584 |
| Value | Count | Frequency (%) |
| 6 | 27584 | |
| 5 | 24375 | |
| 4 | 20711 | |
| 3 | 15128 | |
| 2 | 17867 | |
| 1 | 15397 | |
| 0 | 13139 |
feature13
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 2526 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 134201 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 69230 | |
| 2 | 62445 | |
| 3 | 2526 | 1.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 69230 | |
| 2 | 62445 | |
| 3 | 2526 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 69230 | |
| 2 | 62445 | |
| 3 | 2526 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 134201 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 69230 | |
| 2 | 62445 | |
| 3 | 2526 | 1.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 134201 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 69230 | |
| 2 | 62445 | |
| 3 | 2526 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 134201 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 69230 | |
| 2 | 62445 | |
| 3 | 2526 | 1.9% |
label
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.0 MiB |
| 0 | |
|---|---|
| 1 | 8497 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 134201 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 125704 | |
| 1 | 8497 | 6.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 125704 | |
| 1 | 8497 | 6.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 125704 | |
| 1 | 8497 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 134201 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 125704 | |
| 1 | 8497 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 134201 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 125704 | |
| 1 | 8497 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 134201 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 125704 | |
| 1 | 8497 | 6.3% |
feature14
Real number (ℝ)
| Distinct | 134201 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.4960607 |
| Minimum | -1.0035501 |
|---|---|
| Maximum | 16.812962 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 18 |
| Negative (%) | < 0.1% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | -1.0035501 |
|---|---|
| 5-th percentile | 4.1926212 |
| Q1 | 6.1423066 |
| median | 7.498909 |
| Q3 | 8.8490464 |
| 95-th percentile | 10.788789 |
| Maximum | 16.812962 |
| Range | 17.816512 |
| Interquartile range (IQR) | 2.7067398 |
Descriptive statistics
| Standard deviation | 2.0030492 |
|---|---|
| Coefficient of variation (CV) | 0.26721358 |
| Kurtosis | 0.0035213268 |
| Mean | 7.4960607 |
| Median Absolute Deviation (MAD) | 1.3532393 |
| Skewness | -0.0080284672 |
| Sum | 1005978.8 |
| Variance | 4.0122061 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8.017864755 | 1 | < 0.1% |
| 9.168051523 | 1 | < 0.1% |
| 7.531186299 | 1 | < 0.1% |
| 7.935373383 | 1 | < 0.1% |
| 7.987994204 | 1 | < 0.1% |
| 10.32536268 | 1 | < 0.1% |
| 7.606166055 | 1 | < 0.1% |
| 7.289974592 | 1 | < 0.1% |
| 10.15091605 | 1 | < 0.1% |
| 5.949181714 | 1 | < 0.1% |
| Other values (134191) | 134191 |
| Value | Count | Frequency (%) |
| -1.003550085 | 1 | |
| -0.9290672109 | 1 | |
| -0.847267109 | 1 | |
| -0.6487660799 | 1 | |
| -0.6311311885 | 1 | |
| -0.5234251099 | 1 | |
| -0.4840739771 | 1 | |
| -0.4366576635 | 1 | |
| -0.4186388095 | 1 | |
| -0.4177912709 | 1 |
| Value | Count | Frequency (%) |
| 16.81296174 | 1 | |
| 15.72474357 | 1 | |
| 15.71535894 | 1 | |
| 15.67344242 | 1 | |
| 15.60196316 | 1 | |
| 15.5996583 | 1 | |
| 15.49336001 | 1 | |
| 15.27992714 | 1 | |
| 15.26951288 | 1 | |
| 15.26006981 | 1 |
feature15
Real number (ℝ)
HIGH CORRELATION  UNIQUE 
| Distinct | 134201 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.53177357 |
| Minimum | -0.34274793 |
|---|---|
| Maximum | 1.7033392 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 767 |
| Negative (%) | 0.6% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | -0.34274793 |
|---|---|
| 5-th percentile | 0.17623962 |
| Q1 | 0.37577811 |
| median | 0.51705945 |
| Q3 | 0.66675556 |
| 95-th percentile | 0.94798904 |
| Maximum | 1.7033392 |
| Range | 2.0460872 |
| Interquartile range (IQR) | 0.29097745 |
Descriptive statistics
| Standard deviation | 0.23425001 |
|---|---|
| Coefficient of variation (CV) | 0.44050705 |
| Kurtosis | 0.77024134 |
| Mean | 0.53177357 |
| Median Absolute Deviation (MAD) | 0.14531795 |
| Skewness | 0.50322292 |
| Sum | 71364.545 |
| Variance | 0.054873066 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.028822258 | 1 | < 0.1% |
| 0.921769628 | 1 | < 0.1% |
| 0.5584495287 | 1 | < 0.1% |
| 0.6743813376 | 1 | < 0.1% |
| 0.390424133 | 1 | < 0.1% |
| 0.6733890499 | 1 | < 0.1% |
| 0.672163588 | 1 | < 0.1% |
| 0.4617404011 | 1 | < 0.1% |
| 0.4919832631 | 1 | < 0.1% |
| 0.6771587034 | 1 | < 0.1% |
| Other values (134191) | 134191 |
| Value | Count | Frequency (%) |
| -0.3427479347 | 1 | |
| -0.3353456445 | 1 | |
| -0.3213764401 | 1 | |
| -0.3058587155 | 1 | |
| -0.2967268268 | 1 | |
| -0.2933620709 | 1 | |
| -0.2920209356 | 1 | |
| -0.2884168614 | 1 | |
| -0.2832487307 | 1 | |
| -0.2805849503 | 1 |
| Value | Count | Frequency (%) |
| 1.703339221 | 1 | |
| 1.666962389 | 1 | |
| 1.652025332 | 1 | |
| 1.651679277 | 1 | |
| 1.647956543 | 1 | |
| 1.647864592 | 1 | |
| 1.627449452 | 1 | |
| 1.625455378 | 1 | |
| 1.619805196 | 1 | |
| 1.615795848 | 1 |
feature16
Real number (ℝ)
| Distinct | 134201 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 66.236701 |
| Minimum | 42.803103 |
|---|---|
| Maximum | 88.834367 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 MiB |
Quantile statistics
| Minimum | 42.803103 |
|---|---|
| 5-th percentile | 57.489273 |
| Q1 | 62.775925 |
| median | 66.31578 |
| Q3 | 69.783914 |
| 95-th percentile | 74.721526 |
| Maximum | 88.834367 |
| Range | 46.031265 |
| Interquartile range (IQR) | 7.0079889 |
Descriptive statistics
| Standard deviation | 5.2533946 |
|---|---|
| Coefficient of variation (CV) | 0.079312444 |
| Kurtosis | 0.09721404 |
| Mean | 66.236701 |
| Median Absolute Deviation (MAD) | 3.5018992 |
| Skewness | -0.093354893 |
| Sum | 8889031.5 |
| Variance | 27.598155 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 58.91113205 | 1 | < 0.1% |
| 67.5992062 | 1 | < 0.1% |
| 73.33554323 | 1 | < 0.1% |
| 69.1666375 | 1 | < 0.1% |
| 68.01379208 | 1 | < 0.1% |
| 67.28325164 | 1 | < 0.1% |
| 65.70653351 | 1 | < 0.1% |
| 68.98287813 | 1 | < 0.1% |
| 66.3314471 | 1 | < 0.1% |
| 70.07217808 | 1 | < 0.1% |
| Other values (134191) | 134191 |
| Value | Count | Frequency (%) |
| 42.80310253 | 1 | |
| 44.15271061 | 1 | |
| 44.21972587 | 1 | |
| 44.4247799 | 1 | |
| 44.59322547 | 1 | |
| 44.68902034 | 1 | |
| 44.69719157 | 1 | |
| 44.70530797 | 1 | |
| 44.88639665 | 1 | |
| 45.28013744 | 1 |
| Value | Count | Frequency (%) |
| 88.8343673 | 1 | |
| 88.59156936 | 1 | |
| 88.10419538 | 1 | |
| 87.58466842 | 1 | |
| 87.48454737 | 1 | |
| 87.43937296 | 1 | |
| 87.28445786 | 1 | |
| 86.9780624 | 1 | |
| 86.78564545 | 1 | |
| 86.53818716 | 1 |
| feature3 | feature4 | feature5 | feature6 | feature7 | feature8 | feature9 | feature10 | feature11 | feature12 | feature14 | feature15 | feature16 | feature2 | feature13 | label | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| feature3 | 1.000 | 0.003 | -0.006 | -0.004 | 0.002 | -0.005 | -0.004 | 0.018 | -0.096 | -0.016 | 0.004 | 0.108 | -0.073 | 0.005 | 0.014 | 0.000 |
| feature4 | 0.003 | 1.000 | -0.206 | -0.964 | 0.119 | -0.228 | -0.976 | 0.016 | 0.001 | -0.002 | -0.001 | -0.003 | 0.003 | 0.007 | 0.000 | 0.020 |
| feature5 | -0.006 | -0.206 | 1.000 | 0.207 | -0.173 | 0.917 | 0.212 | 0.035 | -0.007 | 0.002 | 0.002 | -0.003 | -0.000 | 0.005 | 0.003 | 0.007 |
| feature6 | -0.004 | -0.964 | 0.207 | 1.000 | -0.099 | 0.230 | 0.983 | -0.024 | 0.001 | 0.002 | 0.002 | 0.002 | -0.002 | 0.006 | 0.000 | 0.007 |
| feature7 | 0.002 | 0.119 | -0.173 | -0.099 | 1.000 | -0.188 | -0.099 | -0.013 | 0.000 | 0.003 | -0.002 | -0.001 | 0.004 | 0.007 | 0.002 | 0.007 |
| feature8 | -0.005 | -0.228 | 0.917 | 0.230 | -0.188 | 1.000 | 0.236 | 0.036 | -0.008 | 0.002 | -0.000 | -0.001 | -0.001 | 0.007 | 0.000 | 0.012 |
| feature9 | -0.004 | -0.976 | 0.212 | 0.983 | -0.099 | 0.236 | 1.000 | -0.024 | 0.001 | 0.003 | 0.001 | 0.002 | -0.003 | 0.006 | 0.000 | 0.000 |
| feature10 | 0.018 | 0.016 | 0.035 | -0.024 | -0.013 | 0.036 | -0.024 | 1.000 | -0.156 | 0.007 | -0.001 | 0.032 | -0.021 | 0.041 | 0.006 | 0.084 |
| feature11 | -0.096 | 0.001 | -0.007 | 0.001 | 0.000 | -0.008 | 0.001 | -0.156 | 1.000 | -0.005 | -0.000 | 0.020 | -0.013 | 0.266 | 0.457 | 0.291 |
| feature12 | -0.016 | -0.002 | 0.002 | 0.002 | 0.003 | 0.002 | 0.003 | 0.007 | -0.005 | 1.000 | 0.003 | -0.029 | 0.021 | 0.155 | 0.253 | 0.070 |
| feature14 | 0.004 | -0.001 | 0.002 | 0.002 | -0.002 | -0.000 | 0.001 | -0.001 | -0.000 | 0.003 | 1.000 | 0.007 | -0.003 | 0.003 | 0.007 | 0.000 |
| feature15 | 0.108 | -0.003 | -0.003 | 0.002 | -0.001 | -0.001 | 0.002 | 0.032 | 0.020 | -0.029 | 0.007 | 1.000 | -0.105 | 0.054 | 0.022 | 0.704 |
| feature16 | -0.073 | 0.003 | -0.000 | -0.002 | 0.004 | -0.001 | -0.003 | -0.021 | -0.013 | 0.021 | -0.003 | -0.105 | 1.000 | 0.028 | 0.010 | 0.366 |
| feature2 | 0.005 | 0.007 | 0.005 | 0.006 | 0.007 | 0.007 | 0.006 | 0.041 | 0.266 | 0.155 | 0.003 | 0.054 | 0.028 | 1.000 | 0.838 | 0.226 |
| feature13 | 0.014 | 0.000 | 0.003 | 0.000 | 0.002 | 0.000 | 0.000 | 0.006 | 0.457 | 0.253 | 0.007 | 0.022 | 0.010 | 0.838 | 1.000 | 0.042 |
| label | 0.000 | 0.020 | 0.007 | 0.007 | 0.007 | 0.012 | 0.000 | 0.084 | 0.291 | 0.070 | 0.000 | 0.704 | 0.366 | 0.226 | 0.042 | 1.000 |
| feature1 | feature2 | feature3 | feature4 | feature5 | feature6 | feature7 | feature8 | feature9 | feature10 | feature11 | feature12 | feature13 | label | feature14 | feature15 | feature16 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Site engineer | grocery_pos | 8.60 | 48230 | 40.213439 | -85.203756 | 47583 | 42.508293 | -83.168004 | 65.596062 | 3 | 5 | 1 | 1 | 8.017865 | 1.028822 | 58.911132 |
| 1 | Site engineer | gas_transport | 316.84 | 48230 | 44.379391 | -82.859721 | 47583 | 42.661838 | -81.966510 | 64.728795 | 6 | 5 | 1 | 1 | 11.768568 | 1.106217 | 64.431017 |
| 2 | Site engineer | grocery_pos | 294.89 | 48230 | 42.950657 | -84.935542 | 47583 | 42.580470 | -82.408529 | 65.434606 | 3 | 5 | 1 | 1 | 7.996359 | 0.899881 | 57.545348 |
| 3 | Site engineer | shopping_net | 831.08 | 48230 | 39.372111 | -84.893973 | 47583 | 41.948688 | -83.919881 | 64.990422 | 23 | 6 | 1 | 1 | 8.767720 | 1.062966 | 62.681169 |
| 4 | Site engineer | health_fitness | 1063.84 | 48230 | 41.227499 | -83.228392 | 47583 | 41.544743 | -82.123365 | 65.316083 | 23 | 6 | 1 | 1 | 8.816222 | 0.722446 | 63.084486 |
| 5 | Site engineer | shopping_net | 968.45 | 48230 | 45.687573 | -83.143026 | 47583 | 43.148631 | -82.298534 | 64.868803 | 23 | 6 | 1 | 1 | 6.806540 | 1.063795 | 60.675761 |
| 6 | Site engineer | shopping_pos | 981.26 | 48230 | 37.531342 | -78.698564 | 47583 | 42.777359 | -82.995552 | 65.553237 | 18 | 6 | 1 | 1 | 8.913152 | 0.868734 | 64.374301 |
| 7 | Site engineer | shopping_net | 1007.67 | 48230 | 41.526875 | -81.503292 | 47583 | 41.640156 | -82.070042 | 64.349109 | 13 | 6 | 1 | 1 | 5.153043 | 0.908907 | 62.226104 |
| 8 | Site engineer | misc_net | 975.26 | 48230 | 44.916573 | -85.266544 | 47583 | 41.877041 | -82.570052 | 63.353419 | 22 | 5 | 1 | 1 | 6.753615 | 1.021766 | 67.357675 |
| 9 | Site engineer | shopping_net | 1004.32 | 48230 | 40.651219 | -82.094873 | 47583 | 41.945489 | -83.268167 | 64.515283 | 22 | 6 | 1 | 1 | 9.364855 | 1.059914 | 60.399823 |
| feature1 | feature2 | feature3 | feature4 | feature5 | feature6 | feature7 | feature8 | feature9 | feature10 | feature11 | feature12 | feature13 | label | feature14 | feature15 | feature16 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 134191 | Minerals surveyor | shopping_net | 45.06 | 32210 | 30.274085 | -81.969416 | 847415 | 31.081801 | -82.260124 | 30.655925 | 16 | 6 | 1 | 0 | 5.854642 | 0.694827 | 56.548753 |
| 134192 | Minerals surveyor | entertainment | 35.46 | 32210 | 31.621497 | -77.741066 | 847415 | 29.816261 | -80.751488 | 31.377807 | 19 | 2 | 2 | 0 | 6.728941 | 0.536597 | 64.663838 |
| 134193 | Minerals surveyor | health_fitness | 104.28 | 32210 | 32.486831 | -80.227313 | 847415 | 30.775710 | -82.500330 | 31.190823 | 23 | 5 | 2 | 0 | 5.028902 | 0.452881 | 63.044666 |
| 134194 | Minerals surveyor | kids_pets | 25.12 | 32210 | 30.882394 | -81.639615 | 847415 | 29.376789 | -80.890938 | 30.733635 | 14 | 3 | 2 | 0 | 7.309583 | 0.344593 | 71.480156 |
| 134195 | Minerals surveyor | home | 179.24 | 32210 | 30.052630 | -82.470667 | 847415 | 30.045514 | -82.358249 | 31.273273 | 19 | 2 | 2 | 0 | 8.491107 | 0.300137 | 64.687930 |
| 134196 | Minerals surveyor | health_fitness | 132.98 | 32210 | 30.570251 | -80.001638 | 847415 | 29.725058 | -81.319645 | 31.785767 | 23 | 4 | 2 | 0 | 6.930241 | 0.585582 | 61.754724 |
| 134197 | Minerals surveyor | health_fitness | 2.19 | 32210 | 29.150371 | -81.719344 | 847415 | 31.233001 | -81.786202 | 30.800002 | 23 | 3 | 2 | 0 | 7.432464 | 0.424970 | 61.681467 |
| 134198 | Minerals surveyor | kids_pets | 3.16 | 32210 | 34.212880 | -80.634963 | 847415 | 29.874284 | -81.624591 | 30.641819 | 23 | 6 | 2 | 0 | 7.640235 | 0.288538 | 65.003013 |
| 134199 | Minerals surveyor | entertainment | 7.12 | 32210 | 28.515650 | -80.139073 | 847415 | 29.502540 | -82.612350 | 30.809930 | 20 | 5 | 2 | 0 | 4.193106 | 0.489005 | 66.160873 |
| 134200 | Minerals surveyor | entertainment | 7.51 | 32210 | 31.842962 | -80.307771 | 847415 | 29.506603 | -80.806227 | 31.129042 | 21 | 2 | 2 | 0 | 3.082634 | 0.689734 | 67.800253 |